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(57) ABSTRACT 

A language translator is provided which determines memory 
structure at compile time for a plurality of object classes 
including at least one virtual base class and at least one class 
derived therefrom. At compile time, space for pointers 
(b -pointers) is set aside in each class object that will have a 
base table (b-table) associated therewith. The b-pointers 
point, at run time, to an associated b-table containing 
memory offsets between the base classes of the derived 
class. At run time, constructors construct the class objects, 
starting from the most derived class object and proceeding 
through to the base class object. However, instead of gen- 
erating the virtual tables and associated pointers, as well as 
the adjusting functions, at compile time, the language trans- 
lator generates the code for these operation to be executed at 
run time. Then at run time, a virtual function table is 
generated for the base class. Since the necessary offsels are 
known at run time, all of the adjusting functions, the virtual 
function tables, and the virtual pointers may be generated. 
Thus, the system completes the construction of an object. 

9 Claims, 5 Drawing Sheets 
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SYSTEM AND METHOD FOR GENERATING 
AN OBJECT STRUCTURE AT RUN TIME IN 
AN OBJECT-ORIENTED PROGRAMMING 
LANGUAGE 

FIELD OF THE INVENTION 

The present invention is directed to object oriented com- 
puter programming languages and, in particular, to a com- 
piler which implements virtual inheritance in object oriented 
programs. 

BACKGROUND OF THE INVENTION 

Object oriented computer programming (OOP) tech- 
niques for facilitating the development of complex computer 
programs are well-known and widely used. As understood 
by those skilled in the art, these techniques involve the 
definition, creation, use and destruction of "objects." These 
objects are software entities including both data elements 
and functions which manipulate the data elements. The data 
and related functions are treated by the software as an entity 
that can be created, used and deleted as if it were a single 
item. Together, the data and functions enable objects to 
model any real world entity in terms of its characteristics, 
which can be represented by the data elements, and its 
behavior, which can be represented by its data manipulation 
functions. In this way, objects can model concrete things, as 
well as abstract concepts, such as numbers or geometrical 
designs. 

In an OOP programming language, objects are defined by 
creating "classes," which are not objects themselves, but act 
as templates that instruct the compiler how to construct 
actual objects which are "instances" of the classes. For 
example, a class may specify the number and type of data 
variables and the steps involved in the functions which 
manipulate the data. A corresponding object is actually 
created by a special function called a "constructor". The 
constructor uses the corresponding class definition and addi- 
tional information, such as arguments specified during 
object creation, to create an object. Similarly, objects are 
destroyed by a special function called a "destructor" when 
the objects are no longer of use. 

The principle benefits of OOP techniques arise out of 
three basic characteristics: encapsulation; polymorphism; 
and inheritance. Data encapsulation refers to the binding of 45 
data and related functions. More specifically, an object can 
be designed to "hide" (or "encapsulate"), all or a portion of 
its internal data structure and corresponding internal func- 
tions. For instance, during program design, a program devel- 
oper can define objects in which all or same of the data 
variables and all or some of the related functions are 
considered "private" or for use by only the object itself. 
Other data or functions can be declared "public" or available 
for use externally of the object. External access to private 
functions or data can be controlled by defining public 
functions for an object which can be invoked externally of 
the object. The public functions form a controlled and 
consistent interface between the private data and the outside 
world. Any attempt to write program code which directly 
accesses the private functions or data causes the compiler to 
generate an error message during compilation and stop the 
compilation process. 

Polymorphism is a characteristic which allows multiple 
functions that have the same overall purpose, but that work 
with different data, to produce consistent results. Inheritance 
allows program developers to easily reuse preexisting func- 
tions and to reduce the need for creating redundant functions 
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from scratch. The principles of inheritance allow a software 
developer to declare classes (and the objects which are later 
created from them) as related. Specifically, classes may be 
designated as derived classes of other base classes. A derived 
class inherits and has access to functions of its base classes 
just as if these functions appeared in the derived class. 
Alternatively, a derived class can override or modify an 
inherited function merely by defining a new function with 
the same name. Overriding or modifying does not alter the 
function in the base class, but merely modifies the use of the 
function in the derived class. The creation of a new derived 
class which has some of the functionality (with selective 
modification) of another class allows software developers to 
easily customize existing code to meet their particular needs. 

One widely used and well known OOP language is C++. 
The C++ language is classified as a hybrid OOP language, 
as opposed to a pure or orthodox OOP language. Because the 
C++ language was designed as an improvement to and as an 
extension of C, it is full of the traditional features of ANSI 
C. C++ source code is usually compiled before being 
executed. Therefore, the C++ programming process entails 
a development cycle of editing, compiling, linking, and 
running. Although the iteration through the cycle is a slow 
process, the produced code is very fast and efficient. The 
C++ language provides an excellent balance between power 
of expression, run time speed, and memory requirements. 
C++ compilers are commercially available from several 
vendors. 

Inheritance may provide the most power to the class 
concept in OOP. Inheritance allows classes to be continually 
built and extended with essentially no limit. C++ is different 
from some OOP languages because it allows multiple inher- 
itance. 

To illustrate the concept of virtual inheritance, reference 
will be made to the class inheritance trees in FIGS. 1A and 
IB. In FIG. 1A, class D directly descends from both base 
classes B and C and indirectly descends from class A. In this 
example, class D might appear to a compiler to have two 
distinct Aclasses appearing as base classes. Having multiple 
copies of the same base class in an inheritance tree in the 
compiled program is confusing and wastes storage space. To 
solve this problem, a base class may be declared to be virtual 
so that the compiler is directed to share a single copy of a 
given base class object in the derived class objects. A class 
inheritance tree using class A as a virtual base class is 
illustrated in FIG. IB. Virtual inheritance, i.e. inheritance 
from a virtual base class, is a primary strength for improving 
space and run time efficiency of the C++ object model. FIG. 
1C shows the resulting complete class D 10 corresponding 
to the inheritance tree of FIG. 1A, in which the base class A 
is not virtual. FIG. ID shows the resulting complete class D 
15 corresponding to the inheritance tree of FIG. IB, in which 
the base class A is virtual. As shown in FIG. ID, virtual base 
classes are only shared within a complete object, in this case 
the complete object D 15. Also shown in FIG. ID is the 
virtual function table 16 for object D 15, indicating a virtual 
function 17, for example, contained within the virtual base 
class A. 

To use virtual inheritance in a C++ program, the program- 
mer must specify one or more of a class's functions to be 
virtual. Typically, the complete set of virtual functions 
available is fixed at compile time and a programmer there- 
fore cannot add or replace any function of the complete set 
at run time. Accordingly, fast dispatch of virtual function 
invocations is realized at the cost of run time flexibility. 
Virtual function calls are generally resolved by indexing into 
a table (conventionally known as a virtual function table) 
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constructed by the compiler, which holds the addresses of 
the virtual functions associated with the base class. A 
fundamental problem of virtual inheritance is to dispatch 
within the constraints of the C++ object model conventions, 
the virtual functions at run time with the correct object 
pointer for the object that is being processed. 

More specifically, this problem relates to properly obtain- 
ing a pointer which points to a derived class when given a 
pointer to a virtual base class. In the C++ language such a 
pointer is referred to as a "this" pointer. The "this" pointer 
must point to a location in the base class object that contains 
the function. As a result, adjusting functions are used to 
obtain a new "this" pointer pointing to the derived class from 
a "this" pointer pointing to a virtual base class. However, it 
is difficult to correctly obtain the new "this" pointer because 
the virtual function may be shared by many interrelated 
classes having different class structures derived from the 
virtual base class. 

A simple illustration for these terms is provided in FIG. 
2A. Class A is the virtual base class for derived classes B, C, 
D, and E. Thereby, class D is a derived class of class B such 
that class A is a virtual base class to class B and class B is 
a virtual base class to class D. Also, class E is a derived class 
of class B, and also of class C. Because class A has been 
declared a virtual base class by the programmer, virtual 
function table pointers 210 are formed in class A which point 
to a virtual function table 220 associated therewith. The 
virtual function table 220 contains addresses corresponding 
to the functions 230 and 240 associated with class A. When 
the memory structure for the data structure of the virtual 
base class A is determined at compile time, memory space 
is set aside in class A for the virtual function table pointers 
210, which will be initialized to point to the virtual function 
table 220, which in turn addresses the functions 230 and 240. 
The virtual function table 220 is used at run time to invoke 
the functions 230 and 240 associated with class A As a 
result of virtual inheritance, the functions 230 and 240 may 
be shared by many different classes (in the present example 
these virtual function tables are shared by classes B, C, D, 
and E). However, in general, the virtual function table 220 
and adjusting functions 250 and 260 may be different for 
each object of classes A, B, C, D or E. 

At run time, when an object of class A has one of its 
functions called, and when that function is overridden within 
a derived class in the object, then a "this" pointer which is 
passed to that overriding functioo must be obtained from 
information available via the "this" pointer of the base class. 
For example, in FIG. 2 A, if the function 230 is overridden 
in class B, then a call starting in class A must find a "this" 
pointer for class B (i.e., a "this" pointer 270) from infor- 
mation available via the "this" pointer 200. 

In FIG. 2A, the adjusting functions 250 and 260 are 
shown which provide the adjustment of the "this" pointer 
200. These adjusting functions 250 and 260 are small 
"assembly stubs" that obtain ihe correct "this" pointer 270 
for the call to the function 230 or 240 by offsetting from the 
available "this" pointer 200, based on the actual layout of the 
complete object in memory. The adjusting functions 250 and 
260 allow for the entries in the virtual function table 220 to 
remain simple pointers. The address contained within each 
entry of the virtual function table 220 directly addresses a 
function 230 or 240 when no adjustment is necessary; but 
the address addresses an associated adjusting function 250 
or 260 when an adjustment of the "this" pointer 200 is 
necessary. 

Such an implementation of adjusting functions solves the 
aforementioned offset problem if the adjusting functions are 
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constructed at compile time but creates a compatibility 
problem. In FIG. 2A for example, there are two different 
offsets between base class objects B and A that are depen- 
dent upon the configuration of the complete class D or E in 

5 memory. Different offsets may be necessary for the same 
class, as shown for class B in this example. Therefore, the 
correct adjusting functions for intermediate classes (classes 
having at least one virtual base class and being derived by 
at least one other class, such as class B in the present 

1Q example) cannot be uniquely determined for cases where 
entry into a function is effected at the intermediate class. 

FIG. 2B shows example memory layouts for instantia- 
tions of the complete objects E and D as shown in FIG. 2A. 
A first memory layout 400 for object Ej is shown having an 
offset 408 between class A and class B. A second memory 

15 layout E for object E 2 is shown having an offset 410 betweeo 
class A and B, and a third memory layout for object E 3 404 
is shown having an offset 412 between class A and class B. 
A memory layout 406 for object D is shown having an offset 
414 between class A and class B. Thus FIG. 2B shows 

20 several possible offsets between class A and class B. 

In one presently used solution to this problem, the adjust- 
ing functions for all of the possible class instantiations are 
built at compile time. As the classes are analyzed at compile 
time, all of the possible class offsets are determined then 

25 stored in a table. This table is accessed at run time to obtain 
the offset information during construction of objects. 
However, an extra parameter (the extra parameter being a 
table which points to another table having the locations of 
the adjusting functions) must be included in the object 

30 model used by this solution. As a result, this solution is 
incompatible with existing object models because this extra 
parameter will not be recognized by the compilers which 
follow the design conventions as suggested in the C++ 
annotated reference manual. Also because the adjusting 

35 functions are stored in object files on the system's disk they 
must be brought off the disk and into memory, which is a 
relatively slow operation. Accordingly, this solution has the 
additional drawback of slowing down the speed of the 
compiled program at run time. 

40 In another presently used solution, the adjusting functions 
are built at compile time by assuming that the class to be 
constructed is not a base class. Because it is not known how 
intermediate classes will be constructed at run time, an 
additional offset is provided for objects created from inter- 

45 mediate classes. Unfortunately, this offset does not work for 
all circumstances (e.g. cases having multiple interrelated 
base classes). 

This problem of correctly calling virtual functions is 
recognized throughout the industry as an important problem 

50 to solve in facilitating the use of virtual inheritance in OOP 
languages as is readily seen by the large number of proposed 
attempts, which to date fail to completely solve this prob- 
lem. In fact, a solution which provides the proper conversion 
for all class configurations including cases having multiple 

55 virtual base classes and virtual functions has yet to be 
implemented within the conventional object models. 
Therefore, a solution to this problem is desired that will 
always obtain a correct "this" pointer and is compatible with 
the existing object models. 

60 SUMMARY OF THE INVENTION 

It is therefore an object of the present invention to 
generate the correct "this" pointer to a derived object class 
when a virtual function is invoked on a base class object. 
65 It is also another object of the present invention to 
generate a "this" pointer in a manner which is compatible 
with the existing object model used by a compiler. 



09/20/2003, EAST Version: 1.04.0000 



US 6,446,259 B2 



In accordance with the invention, the virtual function 
tables and adjusting functions are generated for some base 
classes at run time, when the offsets from the base classes to 
their derived classes are known. In particular, an object data 
structure is provided by a language translator, such as a 5 
compiler, which determines the memory structure at compile 
time for a plurality of object classes including at least one 
base class and at least one class derived therefrom. At 
compile time, space for pointers (b-pointers) is set aside in 
each base class object that will have a base table (b-table) to 
associated therewith. The b-pointers point at run time to 
their associated b-table, which must contain memory offsets 
between the base class objects within the derived class 
object. At runtime, constructors construct the class objects, 

starting from the most derived class objects and proceeding is ^ n ^mpa^ble^with c^ln/^ mocteb! 
through to the mner base class object. 

However, instead of generating the virtual function tables 
and associated pointers, as well as the adjusting functions, at 
compile time, the compiler generates the code that will do 
the generation at run time. Then at run time, a virtual 20 
function table is generated for the base class. Since the 
correct offsets are known from the contents of the tables at 
this time, all of the adjusting functions, the virtual function 
b-tables, and the virtual pointers may be generated correctly. 
Thus, the system completes the construction of an object. 25 

The adjusting functions, virtual function table, and virtual 
pointers for the most derived class may, of course, be 
generated at compile time as before. However, if they are 
generated at run time, the compiler is able to operate at a 



342. [n this example, class B is a base class for both classes 
D and E. However, the objects of this class within objects of 
classes D and E will have different offsets from the virtual 
base class A. At the end of compile time, the b-table 332 for 
class D will include the class offsets for its base classes B 
and A and the b-table 312 for class B will include the class 
offsets for class A, its only base class. Similarly, the b-table 
342 for class E will include the class offsets for its base 
classes B, C, and A, and the b-table 322 for class C will 
include the offset for class A. 

The present invention utilizes the fact that this base class 
offset information is available at run time when an object is 
being constructed. Specifically, this base class offset infor- 
mation is obtainable to generate the adjusting functions 
without passing any new or extra parameters so that this 



With reference to FIG. 2A, the correct base class offset 
from derived class B to base class A will vary because the 
base class offset may be affected by class C being put 
between classes A and B in a class E object but not in a class 
D object. For discussion purposes, it is assumed that the 
object is class B within the complete object of class D and 
has a base class offset entered in the b-table 312. At run time, 
when the constructor for the virtual base class A is called by 
the constructor for its derived class B during recursive 
construction, the base class offset between A and B is made 
available to base class object A by means of the adjusting 
functions 250 and 260. The virtual function pointers 210, the 
virtual function table 220 and the adjusting functions 250 
and 260 may be generated at run time, because the correct 



faster speed than in the conventional techniques which build 30 base class offset is known to the constructor for class B 



these functions, tables, and pointers at compile time. In other 
words, the time necessary to generate these functions, tables 
and pointers at run time is less than the time necessary to 
retrieve them from the system's disk if generated at compile 
time. 

In addition, the generating process for the virtual function 
tables and the adjusting functions may share identical virtual 
function tables and adjusting functions to further reduce 
runtime overhead. 

BRIEF DESCRIPTION OF THE DRAWINGS 40 

The invention description below refers to the accompa- 
nying drawings, of which: 

FIGS. lAand IB illustrate virtual class inheritance trees; 

FIG. 1C and FIG. ID show the resulting complete classes *5 
corresponding to FIG. lAand FIG. IB respectively; 

FIG. 2A is a block diagram of an object structure using 
OOP techniques; 

FIG. 2B shows example memory layouts for the complete 
objects E and D as shown in FIG. 2A; and 

FIG. 3 is a flowchart of a method for generating an object 
according to an embodiment of the present invention. 



DETAILED DESCRIPTION OF THE 
INVENTION 

A description of how b-tables are used will be provided 
with reference to FIG. 2A. Each of the derived classes B, C, 
D, and E include base table pointers (or b-pointers) 310, 320, 
330, and 340 which point to their own b-tables 312, 322, 
332, and 342 respectively. Class A will not have any 
b-pointers or a b-table because it is not derived from any 
other class. When the classes are compiled, each of the 
b-tables 312, 322, 332, and 342 is generated containing a 
base class offset from the corresponding derived class 



because the base class offset is in the b-table 312 for base 
class B object. 

When the function 230 is called for the object in the 
present example, the "this" pointer value pointer used will 
35 be the "this" pointer 200 pointed to the base class A object. 
The virtual function pointers 210 will point to the virtual 
function table 220 (which has just been generated during run 
time). This table will in turn contain a pointer to the 
adjusting function 250 (which has also been generated at run 
w time). The adjusting function 250 will first take the "this" 
pointer pointing at location 200 and use it to obtain the "this" 
pointer pointing at the derived class B (corresponding to the 
location as shown by 270) and then call the adjusting 
function 230, resulting in the correct function being called 
with the correct "this" pointer. 

An embodiment of the present invention will be described 
with reference to the flow chart of FIG. 3. In the first step 
510, the compiler determines the offsets of data and pointers 
within classes, and the placement of base classes within 
complete classes. Also during step 510, the compiler gen- 
50 e rates the instructions and data for the computer to execute 
during the later steps. Included in these data and instructions 
are any virtual function tables, b-tables, and adjusting func- 
tions that the compiler determines are both possible and 
desirable to include. 

Next at step 520, the space for an object is allocated 
during run time. At step 530, the b-pointers and the b-tables 
are filled with their base class offsets. At this point, the 
object is entirely laid out with the b-pointers and b-tables 
providing a foundation so that data may be accessed. 

After the b-pointers of the object have been filled in, the 
constructors of each of the virtual base class objects are 
called at step 540. 

The program then determines at step 550 whether the 
adjusting functions and the virtual function table for each of 



55 



60 



objects to each of its base class objects, and the b-pointers 65 the virtual base classes are already available, having been 
310, 320, 330, and 340 are set aside in each of the derived made at step 510 or at an earlier invocation of step 560. If 
classes to point to each of the b-tables 312, 322, 332, and the adjusting functions and the virtual function table were 
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not previously generated for a virtual base class, the con- 
structor generates the adjusting functions and the virtual 
function table at step 560 by generating the appropriate 
instructions using information for the b-table and other 
sources. Then, the virtual function table pointer of the class 5 
object is pointed at the appropriate virtual function table at 
step 570. 

In addition, according to another aspect of the present 
invention, resource sharing techniques can be used when 
generating the virtual function tables and the adjusting 10 
functions, so as to reduce the amount of memory space 
utilized. When generating a virtual function table or an 
adjusting function at run time, the program determines 
whether an adjusting function with the identical offset and 
function has already been generated or whether a virtual 
function table having the same addresses has already been 
generated. If an identical one has been generated before, it 
can be shared so that memory optimization is realized. If an 
identical one has not already been generated, then the new 
function or table is simply generated. 

According to another aspect of the present invention, the 
adjusting functions and virtual function tables for some 
objects can be generated at compile time when the correct 
offsets are known. For example, classes D and E are the most 
derived classes in FIG. 2A and the adjusting functions for 
these can be generated at compile time. Then when a class 
object is constructed at run-time the remaining virtual func- 
tion tables and adjusting functions can be generated. 
However, it may take more time to retrieve these functions 
and tables from storage at run time than to actually generate 
these functions and tables at run time. Sharing optimizations 30 
can again be realized for the compile-time generated adjust- 
ing functions and virtual function tables by using the same 
techniques as described above. 

The foregoing description has been directed to specific 
embodiments of the invention. It will be apparent, however, 35 
that other variations and modifications may be made to the 
described embodiments, with the attainment of some or all 
of their advantages. Therefore, it is the object of the 
appended claims to cover all such variations and modifica- 
tions as come within the true spirit and scope of the 40 
invention. 

What is claimed is: 

1. A method for generating an object data structure, 
comprising the steps of: 



25 



3. A language translator comprising: 

a base class offset determiner, operable to determine a 
base class offset between a virtual base class and a 
derived class, said derived class being derived from 
said virtual base class; 

a base table generator, operable to generate a base table 
for said derived class, said base table including said 
base class offset; and 

a program code generator for generating computer pro- 
gram code for being executed at run time to create at 
run time a virtual function table and an adjusting 
function for said virtual base class, the virtual function 
table containing a pointer to said adjusting function, 

said adjusting function being operable to generate another 
pointer, based upon contents of the base table, to the 
derived object when executed. 

4. A language translator according to claim 3, further 
comprising: 

a virtual table generator operable to generate virtual tables 
and adjusting functions for the most derived class at 
compile time; and wherein 

the program code generator generates program code for 
being executed at run time to generate the virtual tables 
and adjusting functions for the remaining classes at run 
time. 

5. A computer program product including a computer 
readable medium having executable instructions represent- 
ing a computer program recorded thereon, said executable 
instructions comprising: 

program code for determining a base class ofiset between 
a virtual base class and a derived class, said derived 
class being derived from said virtual base class; 

program code for generating a base table for said derived 
class, said base table including said base class offset; 
and 

program code for generating other program code for being 
executed at run time to create at run time a virtual 
function table and an adjusting function for said virtual 
base class, said adjusting function being operable to 
generate a pointer, based upon contents of the base 
table, to the derived class object when executed, the 
virtual function table containing another pointer to the 
adjusting function. 

6. A computer program product according to claim 4, said 



, t , , a- t , t , e executable instructions further comprising: 

determining a base class offset between a virtual base 45 . . , r, „ „. 

program code for generating virtual tables and adjusting 

functions for the most derived class at compile time; 

and wherein 
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class and a derived class, said derived class derived 
from said virtual base class; 
generating a base table for said derived class, said base 

table including said base class offset; 
using a compiler to generate computer program code for 
being executed at run time to create at run time a virtual 
function table for said base class and all adjusting 
functions for said virtual base class, the virtual function 
table containing pointers to said adjusting functions; 
and 

at run time, executing said program code to create said 

virtual function table and said adjusting functions, said 

adjusting functions being operable to generate other 

pointer to the derived class based upon contents of said , n * ie< ? lD J ai * s . °^ T f Die 

base table, when executed. denved ob ^ 1 dass " 

2. A method according to claim 1, further comprising the 
steps of: 

generating virtual function tables amd adjusting functions 

for a derived class at compile time; and 
generating virtual tables and adjusting functions for the 

remaining classes at run time. 



the other program code is also for generating virtual tables 
and adjusting functions for the remaining classes at run 
time. 

7. A method according to claim 1, further comprising, 
prior to generating said adjusting function, determining 
whether another adjusting function already has been gener- 
ated which is operable to generate an identical pointer to the 
derived class object 

8. A language translator according to claim 3, wherein 
said program code is also for determining, when executed at 
run tune, whether another adjusting function has been cre- 
ated that is operable to generate an identical pointer to the 



9. A computer program product according to claim 4, 
wherein the other program code is also for determining, 
when executed at run time, whether another adjusting func- 
tion has been created that is operable to generate an identical 
65 pointer to the derived object class. 
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